Automatic Text Simplification for Spanish: Comparative Evaluation of Various Simplification Strategies
نویسندگان
چکیده
In this paper, we explore statistical machine translation (SMT) approaches to automatic text simplification (ATS) for Spanish. First, we compare the performances of the standard phrase-based (PB) and hierarchical (HIERO) SMT models in this specific task. In both cases, we build two models, one using the TS corpus with “light” simplifications and the other using the TS corpus with “heavy” simplifications. Next, we compare the two best systems with the state-of-the-art text simplification system for Spanish (Simplext). Our results, based on an extensive human evaluation, show that the SMT-based systems perform equally as well as, or better than, Simplext, despite the very small datasets used for training and tuning.
منابع مشابه
Readability Indices for Automatic Evaluation of Text Simplification Systems: A Feasibility Study for Spanish
This paper addresses the problem of automatic evaluation of text simplification systems for Spanish. We test whether already-existing readability formulae would be suitable for this task. We adapt three existing readability indices (two measuring lexical complexity and one measuring syntactic complexity) to be computed automatically, which are then applied to a corpus of original news texts and...
متن کاملSpanish Text Simplification: An Exploratory Study Simplificación de textos en Español: Un estudio explorativo
Text simplification is the process of transforming a text into an equivalent which is more understandable for a target user. We focus on text simplification in the Spanish language and present a corpus-based study of simplification operations. The study has implications for the development of an automatic simplification system.
متن کاملA Hybrid System for Spanish Text Simplification
This paper addresses the problem of automatic text simplification. Automatic text simplifications aims at reducing the reading difficulty for people with cognitive disability, among other target groups. We describe an automatic text simplification system for Spanish which combines a rule based core module with a statistical support module that controls the application of rules in the wrong cont...
متن کاملText Simplification Tools for Spanish
In this paper we describe the development of a text simplification system for Spanish. Text simplification is the adaptation of a text to the special needs of certain groups of readers, such as language learners, people with cognitive difficulties and elderly people, among others. There is a clear need for simplified texts, but manual production and adaptation of existing texts is labour intens...
متن کاملTowards Automatic Lexical Simplification in Spanish: An Empirical Study
In this paper we present the results of the analysis of a parallel corpus of original and simplified texts in Spanish, gathered for the purpose of developing an automatic simplification system for this language. The system is intended for individuals with cognitive disabilities who experience difficulties reading and interpreting informative texts. We here concentrate on lexical simplification ...
متن کامل